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DETAILED ACTION 

Response to Amendment 

1 . In response to the office action from 12/3/2008, the applicant has submitted an 
amendment, filed 3/3/2009, amending independent claims 1 and 8-9, while arguing to traverse 
the art rejection based on the limitation regarding second candidate words having a small number 
of phonemes below a preset value and first candidate words having a larger number of phonemes 
equal to or above the preset value (Amendment, Pages 8-9). Applicant's arguments have been 
fully considered, however the previous rejection is maintained, altered with respect to the 
amended claims and due to the reasons listed below in the response to arguments. 

2. In response to the cancellation of claims 2-4, the examiner has withdrawn the previous 
claim objection directed towards improper dependency. 

Response to Arguments 

3. Applicant's arguments have been fully considered but they are not persuasive for the 
following reasons: 

With respect to independent claims 1 and 8-9, the applicant argues that the prior art of 
record, namely Doyle ("Progressive Word Hypothesis Reduction for Very Large Vocabulary 
Continuous Speech Recognition, " 1997) fails to teach that candidate first words have a number 
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of phonemes equal to or above a preset number because they allege that while Doyle teaches that 
the score for first words are boosted, there is no discussion in Doyle of a preset number of 
phonemes for candidate first and second words (Amendment, Page 9). 

In response, the examiner notes that Doyle does teach such a phoneme-based number. 
More specifically, it is pointed out that Doyle recites that a "short word" is defined as being 
measured in terms of a number of phonemes ("a missing phoneme is more costly, the fewer 
phonemes there are in the word", Section 4.11, Page 37). Doyle explains that a misrecognized 
phoneme for these acoustically unstable short words can dramatically affect the recognition of 
such a word type because of the small number of phonemes in the word leaves little room for 
speech recognition error (Section 4.11, Page 37). In order to be even considered for Doyle's 
second word type candidate a word is required to meet this "fewer phonemes" designation 
(Section 4.11, Pages 37-40), which corresponds to the claimed "the second candidate words 
having unstable acoustic characteristic values with a number of phonemes less than a preset 
value". Looking to Doyle's boosted or second word type candidates, it can be seen that all of 
these words meet the "fewer phonemes" criteria because the longest words in terms of phonemes 
are words such as "that", "said", or "five" which consist of three phonemes. Thus since Doyle 
discloses a "fewer phonemes" criteria and a short word candidate list having a phoneme 
maximum of 3, Doyle teaches second candidate words having a number of phonemes (i.e., 3 or 
less) less than a preset value (i.e., 4 or more phonemes). Conversely then in Doyle the first type 
candidate words would not be considered to be short words, feature more reliable recognition 
scores that would cause them to be likely to be picked as candidates with high recognition scores 
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(Section 4.11, Pages 37-39), and have a number of phonemes greater than or equal to the preset 
value. 

For at least the above reasons, the aforementioned applicants' arguments have been fully 
considered, but are not convincing. 

The art rejections of the dependent are traversed for reasons similar to the independent 
claims (Amendment, Page 10). In regards to such arguments, see the response directed towards 
the independent claims. 

Claim Rejections - 35 USC § 112 

4. The following is a quotation of the first paragraph of 35 U.S.C. 1 12: 

The specification shall contain a written description of the invention, and of the manner and process of making 
and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by the inventor of carrying out his invention. 

5. Claims 1 and 5-9 are rejected under 35 U.S.C. 1 12, first paragraph, as failing to comply 
with the written description requirement. The claim(s) contains subject matter which was not 
described in the specification in such a way as to reasonably convey to one skilled in the relevant 
art that the inventor(s), at the time the application was filed, had possession of the claimed 
invention. Amended independent claims 1 and 8-9 all recite that a first candidate words have a 
number of phonemes equal to or above the preset number of phonemes value. The examiner 
reviewed the originally filed specification for this limitation, but at best could only find support 
that the first candidate words have a "larger number of phonemes" (Specification, Page 42). The 
specification does not appear to indicate that these candidates have a number of phonemes equal 
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to the low phoneme number threshold. Thus, claims 1 and 8-9 and their associated dependent 
claims are directed to new matter. 

Claim Rejections - 35 USC §103 

6. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

7. Claim 1 and 7-8 are rejected under 35 U.S.C. 103(a) as being unpatentable over Higgins 
et al (U.S. Patent: 5,218,668) in view of Doyle ("Progressive Word Hypothesis Reduction for 
Very Large Vocabulary Continuous Speech Recognition, " 1997). 

With respect to Claims 1 and 8, Higgins discloses: 

Extraction means for extracting characteristic values of said input speech, the input 
speech comprising a plurality of input words (speech parameter extraction, Col. 5, Lines 45-63; 
and input speech corresponding to a word sequence, Col. 6, Lines 16-46); 

Selection means for selecting one or more candidate first words from the plurality of 
input words to be processed by speech recognition processing, based on a word score that 
represents an evaluation of acoustic scores and language scores calculated using said 
characteristic values (determining a first word hypothesis set based on a matching algorithm 
utilizing a keyword template, Col. 4, Lines 49-66; Col. 6, Lines 16-46; and syntax language 
models, Col. 8, Lines 18-26), and for selecting one or more candidate second words from the 
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plurality of input words based on a second measure different from said first measure 
(determining a second word hypothesis set based on a matching algorithm utilizing a filler 
template relating to keywords, Col. 4, Lines 49-66; and Col. 6, Lines 16-46); 

Score calculation means for calculating said score of said candidate first and candidate 
second words selected by said selection means referencing concatenation information of said 
first and second words (scoring a template string from a concatenation of partial strings of 
existing candidates located in a phrase buffer with current template candidates, Col. 6, Lines 16- 
46; and Col. 8, Line 9- Col. 9, Line 65); and 

Finalizing means for finalizing a words string, as the recognition result of said speech 
based on said score (finalized recognition output corresponding to a string of most likely word 
templates, Col. 6, Lines 63-67; and finalizing phrase recognition, Col. 9, Lines 26-54), wherein 
the word concatenation information is sequentially updated based on the score (accumulating 
scores for partial strings by further concatenating candidates for a current frame to the existing 
partial strings to produce an updated score, Col. 6, Lines 16-46). 

Although Higgins teaches the selection of alternative speech recognition candidates 
corresponding to smaller speech units, Higgins utilizes an acoustic distance algorithm in order to 
make such a selection and not a non-acoustic selection of candidate words having unstable 
acoustic characteristic values with a number of phonemes less than a preset value and a number 
of phonemes equal to or greater than the number for selection of first candidates. Doyle, 
however, teaches the automatic selection of candidate words having acoustically unstable 
constituents (i.e., the shortness of the word contributes to acoustic matching inaccuracy because 
missing phonemes are more costly) from a defined set of short words having a low number of 
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phonemes (i.e., the word "the" consisting of 2 phonemes and see short word list, Section 4.11, 
Pages 39-40) based on an assigned boosting amount, without which the short words would not 
be selected or missed as candidates (Section 4.11, Pages 37-40). Conversely then, in Doyle, the 
first type candidate words would not be considered to be short words, feature more reliable 
recognition scores that would cause them to be likely to be picked as candidates with high 
recognition scores (Section 4.11, Pages 37-39), and have a number of phonemes greater than or 
equal to the preset value. 

Higgins and Doyle are analogous art because they are from a similar field of endeavor in 
speech recognition. Thus, it would have been obvious to a person of ordinary skill in the art, at 
the time of invention, to modify the teachings of Higgins with the short word selection means 
taught by Doyle in order to prevent short words from being lost from consideration in a speech 
recognition process (Doyle, Section 4.11, Page 37). 

With respect to Claim 7, Higgins recites: 

The selection means calculates said score using characteristic values of the speech to 
select said first word based on said score (extracted speech parameters used in keyword template 
matching, Col. 5, Lines 45-63; and Col. 6, Lines 16-21). 

8. Claim 9 is rejected under 35 U.S.C. 103(a) as being unpatentable over Higgins et al in 
view of Doyle and further in view of Franz et al (U.S. Patent: 6,178,401). 

With respect to Claim 9, Higgins in view of Doyle teaches the speech recognition 
method, as applied to claim 8, but does not explicitly teach method implementation as a program 
stored on a computer readable medium. Franz, however, in a similar field of endeavor in speech 
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recognition, further recites implementing a speech recognition method as a program stored on a 
computer readable medium to enable method implementation on one or more general purpose 
computers (Col. 2, Lines 42-67). 

9. Claims 5-6 are rejected under 35 U.S.C. 103(a) as being unpatentable over Higgins in 
view of Doyle and further in view of Holt et al (U.S. Patent: 5,960,447). 

With respect to Claim 5, Higgins in view of Doyle teaches the speech recognition system 
utilizing keyword and alternative model matching to generate candidate hypotheses in 
recognizing an input speech sequence, as applied to claim 1 . Higgins in view of Doyle does not 
teach the use of a storage means for memorizing speech recognition results and using the results 
in a subsequent alternative recognition, however Holt discloses a means for storing a confidence 
score from a recognition engine for use in a speech recognition process (Col. 9, Lines 7-61). 

Higgins, Doyle, and Holt are analogous art because they are from a similar field of 
endeavor in speech recognition. Thus, it would have been obvious to a person of ordinary skill 
in the art, at the time of invention, to modify the teachings of Higgins in view of Doyle with the 
confidence score storage means taught by Holt in order to provide an improved alternative 
speech recognition means for editing and correcting speech recognition results (Holt, Col. 1, Line 
65- Col. 2, Line 21). 

With respect to Claim 6, Holt further recites: 

Inputting means for providing an input for correcting the results of speech recognition; 
wherein said storage means stores the results of the speech recognition corrected by the input 
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from said inputting means (editing a recognition result and updating a confidence score, Col. 9, 
Lines 36-61). 

Conclusion 

10. Applicant's amendment necessitated the new ground(s) of rejection presented in this 
Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). 
Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the date of this 
final action. 

1 1 . The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure: See PTO-892. 
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12. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to James S. Wozniak whose telephone number is (571) 272-7632. 
The examiner can normally be reached on M-Th, 7:30-5:00, F, 7:30-4, Off Alternate Fridays. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached at (571) 272-7602. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 

/James S. Wozniak/ 

Primary Examiner, Art Unit 2626 



